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Abstract 

We show that the Kth largest subset problem and the Kth largest 
m-TUPLE problem are in PP and hard for PP under polynomial-time Turing 
reductions. Several problems from the literature were previously shown NP- 
hard via reductions from those two problems, and by our main result they 
become PP-hard as well. We also provide complementary PP-upper bounds 
for some of them. 
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1. Introduction 

The following two problems are listed in Garey and Johnson’s classical 
compendium on the theory of NP-completeness [H, p. 225] as problems [SP20] 
and [SP21], respectively: 

Ktr largest subset problem 

INSTANCE: A set A = {xi,..., C N and K,B eN. 

QUESTION: Is # (T C A : x < B} > K f 

Kth largest m-TUPLE problem 

INSTANCE: Finite sets Ai,..., A^ C N and A, 5 e N. 

QUESTION: Is # {(xi,..., x^) e W x ■ ■ ■ A^ : J2T=i ^^>B}>K1 
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The first problem was initially introduced by Johnson and Kashdan and 
the second by Johnson and Mizoguchi [^. Garey and Johnson [l| state that 
both problems are NP-hard under polynomial-time Turing reductions but 
mention that their membership in NP, and, thus, their NP-completeness, is 
open. For the NP-hardness proof of the JFth largest subset problem, 
see Theorem 6 in the paper of Johnson and Kashdan j^, or the discussion 
provided by Garey and Johnson [l], p. 115]. It is unknown whether this 
problem is also NP-hard under polynomial-time many-one reductions Bp- 
148]. The iPTH LARGEST SUBSET problem has its name for historical reasons: 
note that the the problem is about the Ki\i smallest subset. 

Both problems have repeatedly been used as starting points for prov¬ 
ing other problems NP-hard. In this note, we show that the JFth largest 
SUBSET and the iPTH largest m-TUPLE problem are complete for the com¬ 
plexity class PP. More precisely, they are in PP, and they are hard for PP 
under polynomial-time Turing reductions. The complexity class PP B can 
be dehned as the class of languages L C S* that have an NP Turing ma¬ 
chine Ml such that for all words tc G S* one has tc G L if and only if more 
than half of the computation paths of Mi on w are accepting. The class PP 
includes NP B- Glosely related is the function class #P, which consists 
of those functions / : S* —)• N for which there exists an NP Turing ma¬ 
chine Mf such that for all words w & Ti* the function value f{w) is equal 
to the number of accepting computation paths of Mf on w. The class PP 
is remarkably powerful: Toda’s Theorem B states that P^^ (which equals 
P^^) contains the polynomial-time hierarchy PH. This means, in particular, 
that if the JFth largest subset (or m-TUPLE, resp.) problem were in NP 
then the polynomial-time hierarchy would collapse to P^^. This can be taken 
as evidence (under standard assumptions from complexity theory) that the 
problems considered in this note are not in NP and, hence, are harder than 
NP. 

By an application of the main result of this note, it follows that sev¬ 
eral other problems for which reductions from the iPTH LARGEST SUBSET 
(or m-TUPLE) problem have been constructed are not only NP-hard under 
polynomial-time Turing reductions, but in fact PP-hard under polynomial¬ 
time Turing reductions. We give examples of such problems in Section [3l 
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2. Main Result 


The class PP can equivalently be characterised as the class of lan¬ 
guages L C E* that have a polynomial-time bounded deterministic Turing 
machine M with access to a fair coin such that for all w ^ Tj* we have w & L 
if and only if M accepts w with probability at least 1/2. This follows from 
Theorem 4.4 of Simon [^, and in particular from the fact that PP is closed 
under complement , see also the work of Gill . However, in some proofs 
it will be more convenient if the Turing machine has access to biased rather 
than fair coins, and if the biases (rational numbers between 0 and 1) can 
depend on the input. More precisely, a probabilistic polynomial-time bounded 
Turing machine (PP-TM) is a polynomial-time bounded Turing machine 
that takes an input w and then operates in two phases: 


1. Depending only on w, it deterministically computes finitely many ra¬ 
tional numbers bi,... ,bk with bi G (0,1) for alH G {1,..., k}. Each bt 
is represented as a quotient of integers written in binary. We view the bt 
values as biases of biased coins. 

2. It acts like a nondeterministic polynomial-time bounded Turing ma¬ 
chine, but the nondeterminism is resolved probabilistically using biased 
coins with biases bi,... ,bk that were computed in the hrst phase. More 
concretely, besides deterministic instructions, the Turing machine has 
instructions of the form 


if coin(6j) then goto cl else goto c2 

where bi is a bias computed in the first phase. The semantics of this 
operation is the natural one: with probability bi the Turing machine 
continues at cl, otherwise (i.e., with probability 1 — bi) it continues 
at c2. 

We have the following lemma. 

Lemma 1. Let L C S* and M be a PP-TM. Furthermore, let r : S* —)■ 
(0,1) be a polynomial-time computable function returning a rational number 
encoded as a guotient of integers in binary such that for all w E 'E* we 
have w E L if and only if M accepts w with probability at least t{w). Then 
L E PP. 


Proof. The proof consists of three steps: 
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1. We show that we can, without loss of generality, take t{w) = 1/2 for 
all w e S*. 

2. We construct from M an NP Turing machine M' such that for all 
tc G E* we have tc G T if and only if at least half of the computation 
paths of M' on w are accepting. 

3. We argue that the existence of M' implies membership of L in PP. 

In the hrst step of the proof, we show that we can, without loss of generality, 
take t{w) = 1/2 for all w G E*. Indeed, given M and r, construct a PP- 
TM M' as follows. Given w G E*, the PP-TM M' hrst computes r := t{w). 
If r < 1/2, it computes p := (1/2 — r)/(I — r), and accepts immediately 
with probability p. Otherwise (i.e., with probability 1 — p) the PP-TM M' 
simulates M. It follows that 

Pr(M' accepts w) = p + {1 — p) ■ Pr(M accepts w). 

A simple calculation shows that Pr(M accepts w) > t{w) if and only if 
Pr(M' accepts w) > 1/2. 

Similarly, if r > 1/2, the PP-TM M' computes p := l/(2r), and rejects 
immediately with probability 1 —p. Otherwise (i.e., with probability p) the 
PP-TM M' simulates M. It follows that 

Pr(M' accepts w) = p ■ Pr(M accepts w). 

Again, we have Pr(M accepts w) > t{w) if and only if Pr(M' accepts w) > 
1 / 2 . 

Consequently, for the remainder of the proof we can assume that for all 
tc G E* we have w E L if and only if M accepts w with probability at 
least 1/2. Furthermore, we can ensure that for any tc G E* all computation 
paths of M on tc involve the same number, say p^,, of accesses to a coin. 

In the second step of the proof, we construct from M an NP Turing 
machine M' such that for all w E T,* we have w E L if and only if at least 
half of the computation paths of M' on w are accepting. Given a word 
w E Tf*, the Turing machine M' computes, like the PP-TM M, the set B 
of biases. Then M' computes a multiple, say Qw, of the denominators of the 
biases in B. Subsequently, M' simulates M, except that when it hits an 
instruction 

if coin(6) then goto cl else goto c2 

then M' takes b ■ nondeterministic branches to cl and (1 — &) ■ non- 
deterministic branches to c2. By the dehnition of q^ the numbers b ■ q^, 
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and (1 — 6 ) • are natural numbers. They might be of exponential mag¬ 
nitude, but their representation in binary is of polynomial length. Using 
this representation, the Turing machine M' can implement the b ■ and 
{1 — b) ■ qy^ nondeterministic branches using cascades of binary nondetermin- 
istic branches. By this construction, we have for any w G E* that the total 
number of computation paths of M' on w equals {qwY'^ and, furthermore, 
if M accepts w with probability x then x ■ {qwY'^ computation paths of M' 
on w are accepting. We conclude that we have tc G T if and only if at least 
half of the computation paths of M' on w are accepting. 

For the third and hnal step of the proof, construct an NP Turing ma¬ 
chine M” from M' by swapping accepting and rejecting states. Then we 
have w ^ L ii and only if more than half of the computation paths of M" 
on w are accepting. Hence, using the dehnition of PP, the complement of L 
is in PP. But PP is closed under complement, see j^, thus L G PP. □ 

In order to show PP-hardness of the iPTH largest subset and the 
Kth largest m-TUPLE problem, we employ ^^P-completeness of a variant 
of the classical SubsetSum problem. Given a set X = {xi,... ,Xm} C hj 
and T G N, t^SubsetSum asks for determining the number of subsets Y of 
X whose elements sum up to T, i.e., computing 



Building upon the results of Hunt et al. j^, Faliszewski and Hemaspaandra 

p. 104] derive that #SubsetSum is #P-complete, where #P-hardness 
holds under parsimonious many-one reductions. As dehned by Simon j^, 
given functions /, (7 : E* —)■ N we say that / parsimoniously reduces to g 
if there is a polynomial-time computable function h : E* —)■ E* such that 
f{w) = g{h{w)) for all w G E*. For subsequent reference, let us capture the 
result of Faliszewski and HemaspaandraOO by the following proposition. 

Proposition 2. t^SubsetSum is ffP-complete, and in particular ffP-hard 
under parsimonious many-one reductions. 

We are now prepared to prove our main result. In the remainder of 
this note, whenever we refer to PP-completeness, PP-hardness holds under 
polynomial-time Turing reductions. In the statements of the theorems we 
make this explicit to avoid a confusion of casual readers. 
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Theorem 3. The i^TH largest subset problem and the Ktr largest 
m-TUPLE problem are PP-complete under polynomial-time Turing reduc¬ 
tions. 

Proof. First, we prove membership in PP for both problems. Given an 
instance X = {xi, ..., x^} C M and 5, iF G N of the Ktr largest sub¬ 
set problem, we can constrnct a PP-TM M that hrst chooses every snb- 
set Y O X nniformly at random by selecting every Xi with probability 1/2. 
Snbsequently, M accepts if and only if Yhx&r ^ < B. It follows that the 
probability that M accepts is eqnal to 



Hence, #{P C X : < B} > K ii and only if M accepts with 

probability at least K/2^. By Lemma [H the Ktr largest subset problem 
is in PP. 

For the Ktr largest m-TUPLE problem, the constrnction is sim¬ 
ilar: a PP-TM M selects an m-tnple (xi,...,Xm) G Xi x ■ ■ • x Xm 
uniformly at random, i.e., each m-tuple with probability l/n^ 

By the characterisation of PP in Lemma [H this can be achieved by in¬ 
dependently choosing some Xi G Xi with probability for every 

1 < i < m. Then, M accepts if and only if Hence, 

{(xi,..., Xm) G Xi X • • • X Xm ■ > X if and only if M ac¬ 

cepts with probability at least iP/An application of Lemma [U 
shows that the Ktr largest tti-tuple problem is in PP as well. 

In order to show hardness for PP, we give polynomial-time Turing re¬ 
ductions from the canonical PP-complete problem MajSAT B 0. This 
problem asks, given a Boolean formula 'ip over n Boolean variables bi,. ■ ■ ,bn, 
does a majority (i.e., at least 2"'“^ -|- 1) of the variable assignments satisfy p). 
Since ^(i^SAT, i.e., the task of computing the number of satisfying assignments 
of a Boolean formula, is ^P-complete [1^, it follows from Proposition H] that 
there is a polynomial-time computable function h such that h{pp) outputs an 
instance X = {xi,..., Xm} and T G N of ^SubsetSum such that 



= # {(6i,..., M e {0, ir : ..., M = 1} • 
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Using binary search, with at most 2 • m queries to the ii'TH largest 
SUBSET problem we can compute 

M:=#iucX:^x<Tl and N := X :^x <T - I 

I xGT J I xGT 

We now have that L = M — N, and hence a majority of the variable assign¬ 
ments satishes '0 if and only if M — iV > 2”“^ -|- 1. 

In the same fashion, we can show hardness of the i^TH largest m- 
TUPLE problem. Set Xi := {0,2:^} for every 1 < i < m. As before, using 
binary search with at most 2 • m queries we can compute 



and 


{ lib 

(xi,. . . , Xm) e Xi X ■ ■ ■ X Xm : 'y^Xj > T 

= #iucX;5^x>Tl 
I xGF J 

{ m 

(xi,..., Xm) e Xi X • • • X ^ Xi > T -h 1 

= #iucX;5^x>T + ll. 

I x£Y j 


As above, L = M — N and a majority of the variable assignments satishes 
"0 if and only if M — X > 2*^“^ -|- 1, from which PP-hardness of the problem 
follows. □ 


3. Applications 

In this section, we discuss several problems whose NP-hardness has been 
shown in the literature by reductions from the Ath largest subset (or m- 
tuple) problem. By Theorem [3l all of these problems become PP-hard. As 
mentioned in Section [H Toda’s Theorem implies that if any of these problems 
were in NP (and hence NP-complete), the polynomial-time hierarchy would 
collapse to P^^. 
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3.1. The i^TH LARGEST-AREA CONVEX POLYGON Problem 


Chazelle ll| posed the following question: “[Gjiven n points in the Eu¬ 


clidean plane, how hard is it to compute the Kt\i largest-area convex polygon 
formed by any subset of the points?” This question was essentially answered 
by Salowe |l2|. In particular, the following decision variant was considered 


m 


/fTH LARGEST-AREA CONVEX POLYGON 

INSTANCE: A set P of n points (xi, X 2 ) G N x M, and A, P G N. 
QUESTION: Are there at least K subsets P' C P for which P' = 
extr{conv{P')) and the area of conv{P') is at least B ? 

Here, conv{P') refers to the convex hull of the points in P', and 
extr{conv{P')) refers to the set of extreme points in that convex hull. Sa¬ 
lowe [I^ showed that the Ath largest-area convex polygon problem 
is NP-hard, via a reduction from the Ath largest m-TUPLE problem. In 
fact, we have: 


Theorem 4. The Ath largest-area convex polygon problem is PP- 
complete under polynomial-time Turing reductions. 


Prooe. PP-hardness follows from the reduction in [12| combined with The¬ 
orem 131 Towards membership in PP, construct a PP-TM M, which, given 
an instance of the Ath largest-area convex polygon problem, selects 
a subset P' of P uniformly at random, and then checks (in polynomial time) 
whether P' = extr{conv{P')) and the area of conv{P') is at least B. If yes, 
M accepts, otherwise M rejects. So the acceptance probability is at least 
A/2"^ if and only if the given instance of the Ath largest-area con¬ 
vex POLYGON problem is a yes-instance. An application of Lemma [1] shows 
membership in PP. □ 


Salowe [ 1 ^ remarked that the corresponding enumeration problem is ^P- 
complete. From this observation it also follows that the Ath largest- 
area CONVEX POLYGON problem is PP-hard, although Salowe [l^ only 
claims NP-hardness. He also says “it is not clear that [the Ath largest- 
area CONVEX POLYGON problem] is in NP”. But this seems unlikely in view 
of Theorem 01 






3.2. The VERTEX counting Problem 

Let m, n G N with m > n. Let b G Q™ be a column vector and A G 
be a rational m x n matrix with its rows denoted by Oi,..., G Q"'. Then 
A and b define a polyhedron P := {x G M"' : A ■ x < b}. A vertex of P is a 
point that is the unique intersection of at least n of the bounding hyperplanes 
ai-x = 6j,iG{l,...,m}. The following problem was considered by Dyer (l3j |: 

VERTEX COUNTING 

INSTANCE: A matrix A G b G Q”, and K eN. 

QUESTION: Does the polyhedron A- x < b have at most K vertices ? 

Dyer [l^, Proposition 3] has shown that the vertex counting problem is 
NP-hard by a Turing reduction from the Ath largest subset problem. 
In fact, we have: 


Theorem 5. The vertex counting problem is PP-complete under 
polynomial-time Turing reductions. 


Proof. PP-hardness follows from the reduction in 13 combined with The¬ 


orem [31 To prove membership in PP, we use the fact that PP is closed under 
complement j^. So it suffices to prove PP-membership of the CO-vertex 
COUNTING problem, which is like the vertex counting problem except 
that it asks whether the polyhedron has at least K vertices. Construct a 
PP-TM M, which, given an instance of the CO-vertex counting prob¬ 
lem, selects n bounding hyperplanes, say hi,... ,hn, uniformly at random, 
and then checks whether they have a unique intersection. If no, M rejects. 
Otherwise M computes the unique intersection, say p, and checks whether 
p is in the polyhedron. If no, M rejects. Otherwise M computes the set S 
of all bounding hyperplanes on which p lies. Then M computes the lexi¬ 
cographically smallest u-element subset S' of S such that p is the unique 
intersection of the hyperplanes in S' . That set S' can be computed in poly¬ 
nomial time, e.g., by greedily adding hyperplanes with linearly independent 
normal vectors. If S' = {hi,..., hn}, then M accepts, otherwise M rejects. 
By construction, any vertex p is accepted by exactly one run of M. So the 
acceptance probability is at least K j if and only if the given instance 
of the GO-VERTEX COUNTING problem is a yes-instance. An application of 
Lemma [1] shows membership in PP. □ 
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3.3. Routing Flows with Delay Guarantees 


In the problem of routing flows through networks is considered, where 
the information available for making routing decisions is probabilistic. The 
aim of the routing algorithm is to guarantee certain quality-of-service re¬ 
quirements. One of the specihc problems is to evaluate the end-to-end delay 
of a given path, as we describe in the following. 

Let Cl, ..., be a path in a network. For each i G {1,. .., n}, the number 
di G Mo is a random delay introduced by the link e*. For all i G {1,... ,n} 
and k G Nq, the probability that di = k is given by Pi^k ^ [0,1] O Q. It is 
assumed that for each i there are only hnitely many k with pi^k > 0, and 
that the delays di are independent. The end-to-end delay is 
end-to-end delay requirement is given by a maximum delay D E No and a 


probability threshold r G [0,1 
problem as “problem P(7r 
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Guerin and Orda refer to the following 


DELAY GUARANTEE 

INSTANCE: A path length n G M, a list of all positive Pi^k ^ [0,1] OQ, 
the maximum delay D G No, a probability threshold r G [0,1] fl Q. 
QUESTION: Is the probability of Ym=i di < D at least r ? 

In H Lemma IV. 1] the delay guarantee problem was shown NP-hard 
by a reduction from the Ath largest subset problem. In fact, we have: 


Theorem 6. The delay guarantee problem is PP-complete under 
polynomial-time Turing reductions. 


Proof. PP-hardness follows from the reduction in [L^ combined with The¬ 
orem |3l Towards membership in PP, construct a PP-TM M, which, given 
an instance of the delay guarantee problem, takes independent ran¬ 
dom values for the di according to the given pi ^ and then checks whether 
< D. If yes, M accepts, otherwise M rejects. So the acceptance 
probability is at least r if and only if the given instance of the delay guar¬ 
antee problem is a yes-instance. An application of Lemma [ 1 ] shows mem¬ 
bership in PP. □ 


3 . 4 . Further Probabilistic Problems 

The Ath largest subset problem has been used by Laroussinie and 
Sproston [l^ to show NP-hardness and coNP-hardness of model-checking 
“durational probabilistic systems” (a variant of timed automata) against the 
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probabilistic and timed temporal logic PTCTL. Combining this rednction 
(which is similar to the one for the delay guarantee problem) with The- 


to 


orem [3] strengthens the NP-hardness and coNP-hardness resnlts of 15 
PP-hardness. 

Finally, the Kth largest m-TUPLE problem was nsed by Cire et ah [1^ 
to show NP-hardness of another probabilistic problem, namely determining 
whether a so-called “chance-alldifferent constraint” has a solntion. This is 
a weighted extension of stochastic constraint programming. Cire et ah (l^ 
remarked: “Also, we are not aware if the problem of deciding whether there 
exists a feasible solntion to chance-alldifferent is in NP.” Bnt this seems 
unlikely, as their rednction combined with Theorem [3] proves PP-hardness of 
this problem. 
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